PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_19393_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family HD-ZIP
Protein Properties Length: 770aa    MW: 85517.2 Da    PI: 5.4447
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_19393_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.22.2e-2188143156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     +++ +++t++q++e+e+lF+++++p+ ++r +L+++lgL+ rqVk+WFqNrR+++k
  Neem_19393_f_1  88 KKRYHRHTAHQIQEMEALFKECPHPDDKQRMKLSQELGLKPRQVKFWFQNRRTQMK 143
                     688999***********************************************998 PP

2START167.68.2e-532855092206
                     HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS..............SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
           START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.............dsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                     la + ++el+k+  a+ep+Wv+s   eng+evl  +e ++              +++ea r+++vv+m++ +lv ++ld + +W e ++    +a
  Neem_19393_f_1 285 LAISSMNELIKMCHANEPLWVRSN--ENGKEVLNLEEHARIfpwplnlkqhsseFRTEATRDTAVVIMNSITLVDTFLDAN-KWMELFPsivaRA 376
                     77899*******************..*******999888889***************************************.******99999** PP

                     EEEEEECTT.....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEE CS
           START  80 etlevissg.....galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwv 167
                     +t++vi sg     g lqlm+aelq+ splvp R+ +f+Ry++q  ++g+w+ivd  +ds  ++  ++s+   +++pSg++i++++ng+s+vtwv
  Neem_19393_f_1 377 KTIQVIASGvsgasGSLQLMYAELQVVSPLVPtRETYFLRYCQQnVEEGTWAIVDFPIDSFHENI-QPSFPLYRRRPSGCVIQDMPNGYSRVTWV 470
                     ********************************************99*************999998.67777777********************* PP

                     E-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 168 ehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     eh++++++ +h+++ ++v sg+a+ga++w++ lqrqce+
  Neem_19393_f_1 471 EHAEMEEKPVHQIFSQFVYSGMAFGANRWLSVLQRQCER 509
                     *************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.84E-2078146IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.3E-2182151IPR009057Homeodomain-like
PROSITE profilePS5007117.28185145IPR001356Homeobox domain
SMARTSM003893.7E-2086149IPR001356Homeobox domain
CDDcd000864.90E-1988146No hitNo description
PfamPF000467.4E-1988143IPR001356Homeobox domain
PROSITE patternPS000270120143IPR017970Homeobox, conserved site
PROSITE profilePS5084846.142275512IPR002913START domain
SuperFamilySSF559613.19E-33276511No hitNo description
CDDcd088752.88E-119279508No hitNo description
SMARTSM002346.1E-36284509IPR002913START domain
PfamPF018521.1E-44285509IPR002913START domain
Gene3DG3DSA:3.30.530.202.4E-5359493IPR023393START-like domain
SuperFamilySSF559613.41E-11546738No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 770 aa     Download sequence    Send to blast
MYGDCQVMSN MGGGNIVSSE TLFSSPSTVQ NPNFNFMPFH SFPHIAPKEE NGLLMRSKED  60
MESGSGSEQF EEKSGNELES SEQQQPKKKR YHRHTAHQIQ EMEALFKECP HPDDKQRMKL  120
SQELGLKPRQ VKFWFQNRRT QMKAQQDRAD NAILRAENET LKNENYRLQA ELRSVICPNC  180
GGPAILGGIS FEDLRLENAR LREELERVCC LASRYSGRPV QAMAPAPPMI PPSLDLDMNI  240
YSRHFAVPMA TCTDIMPMPM LPETSAFPET GLILMEEEKS IAMELAISSM NELIKMCHAN  300
EPLWVRSNEN GKEVLNLEEH ARIFPWPLNL KQHSSEFRTE ATRDTAVVIM NSITLVDTFL  360
DANKWMELFP SIVARAKTIQ VIASGVSGAS GSLQLMYAEL QVVSPLVPTR ETYFLRYCQQ  420
NVEEGTWAIV DFPIDSFHEN IQPSFPLYRR RPSGCVIQDM PNGYSRVTWV EHAEMEEKPV  480
HQIFSQFVYS GMAFGANRWL SVLQRQCERI ASLMARNIAD LGVIPSPEAR KNLMRLAQRM  540
IGTFCVNIST SSGQSWTALS DSCADTVRIT TRKITEPGQP NGVILCAAST TWLPYPHYQV  600
FDLLRDERRR SQVASNSSQH VELMLQESCT DQCGSLVVYT TIDVDSIQLA MSGEDPSCIP  660
LLPLGFVINP VEPIKETTSG DGNSISSSEE AAAANGKNSG CLLTVGLQVL ASTIPSAKLN  720
LSSVNAINNH LCNTVNQITA ALSSGSGTSC PDNNGSVIGT CTQPNGAPKQ
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAM4279442e-44AM427944.2 Vitis vinifera contig VV78X076481.6, whole genome shotgun sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006471522.10.0PREDICTED: homeobox-leucine zipper protein HDG5 isoform X1
SwissprotQ9FJS20.0HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLV4T3K10.0V4T3K1_9R
STRINGPOPTR_0003s09470.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43562548
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7